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Abstract: Stop squarks with a mass just above the top's and which decay to a nearly 
massless LSP are difficult to probe because of the large SM di-top background. Here 
we discuss search strategies which could be used to set more stringent bounds in this 
difficult region. In particular, we note that both the rapidity difference Ay(t, t) and spin 
correlations (inferred from, for example, A(p(£ + ,£~)) are sensitive to the presence of stops. 
We emphasize that systematic uncertainties in top quark production can confound analyses 
looking for stops, making theoretical and experimental progress on the understanding of 
Standard Model top production at high precision a very important task. We estimate that 
spin correlation alone, which is relatively robust against such systematic uncertainties, can 
exclude a 200 GeV stop at 95% confidence with 20 ftr 1 at the 8 TeV LHC. 
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1. Introduction 

Supersymmetry remains the most compelling theoretical explanation for how the vast hier- 
archy between weak and Planck scales can persist in the face of quantum mechanical effects. 
The leading quantum corrections that make the Standard Model an unnatural theory are 
the quadratically divergent contributions of top quark loops to the Higgs boson mass term, 
which are largely canceled in supersymmetric theories by loops of scalar tops. Minimizing 
fine tuning in supersymmetric theories, then, requires that the stop mass be as close to 
the top mass as possible. From the theoretical viewpoint, this motivates consideration of 
models in which the stop may be among the lightest superpartners [1, 2]. 

The LHC is setting ever more stringent bounds on SUSY. Very roughly, if the first 
and second generation squarks are all of the same mass and they decay as q — > q + x° the 
current bounds require niq > 1.3 TeV (for decoupled gluinos), and if the gluinos decay as 
g qq + x the bound is roughly m,g > 1 TeV (for decoupled squarks). These bounds come 
primarily from searches for jets + MET [3, 4, 5, 6], and become stronger if squarks and 
gluinos have comparable masses. These results force "vanilla" supersymmetry, meaning 
the class of well-studied models with flavor universal squark masses and large amounts of 
missing transverse momentum in decays, into an uncomfortably unnatural corner. 
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However, current data allows another interesting possibility. If squarks of the third 
generation are significantly lighter than those of the first two, the bounds on the lighter 
particles can be considerably weaker [7, 8, 9, 10, 11, 12]. Sbottoms which decay as b — > 
b + x°, f° r instance, are only constrained to > 400 GeV for light LSPs, with the bounds 
becoming much less stringent as m x o is increased [13]. Perhaps the weakest bounds are 
on the stop, which after 1 fb _1 is still allowed to be at or perhaps slightly below the top 
mass, according to theoretical estimates [14, 8] that carry too much uncertainty in detector 
modeling to make a sharp claim. Third generation squarks produced indirectly in gluino 
decays lead to signatures of same-sign dileptons or of jets and MET, constraining the gluino 
to be heavier than about 850 to 950 GeV [15, 16, 17, 18], which is still compatible with 
naturalness. 1 

These considerations strongly motivate a deeper study of the phenomenology of light 
stops. The study of simplified spectra, based on naturalness considerations, in which a 
stop decays to a top and neutralino was advocated in [26] , and a number of related works 
have appeared since [27, 28, 29, 30], as well as variations in which the stop plays a role in 
coannihilation to explain dark matter abundance [31, 32] or has flavor- violating couplings to 
explain top Aps [33]. (We have recently learned of two other papers in preparation [34, 35] 
that take different approaches to the problem, which may be complementary to ours.) The 
challenge of stop phenomenology is to disentangle the stop signal from top backgrounds. 
For our purposes, it is useful to introduce a division of t — > bW + x° signals into three 
categories depending on the stop mass: 

• Three-body stops: if stops are light enough, they cannot decay to an on-shell top quark 
and a neutralino (or gravitino), so the three-body decay t — > bW + x° dominates. 
(With more extreme squeezing of the spectrum, a four-body decay or flavor-violating 
decay may result, but we will not consider this limit.) Many kinematic distributions 
such as or transverse mass for stop decay will be significantly different from those 
in top decay [36, 14]. However, this region also suffers from relatively low acceptance, 
and probably deserves a closer look in the future. 

• Two-body stops: if rrit, the decay is two-body, i — > tx°, and the invisible 
neutralino or gravitino can carry away a large amount of momentum. The simplest 
way to exploit this is to impose a hard MET cut. More sophisticated approaches use 
the fact that MET in the top background arises from neutrinos in W decay, which 
suggests the power of a transverse mass cut in the semileptonic case [27] and a cut 
on Mt2 computed with the two leptons and MET in the dileptonic case [37, 8]. A 
recent update on this regime appeared in Ref. [38]. 

• Stealth ("one-body") stops: if the neutralino (or gravitino) is light and is only 
slightly larger than mt, two-body decays t — > tx° in which the momentum of x° is 
very small can predominate. Kinematically, these are effectively "one-body decays" 
since the top carries all the momentum, and stop pair production can be extremely 

1 For theoretical models which motivate a closer look at SUSY with light third generation superpartners 
see e.g. [19, 20, 21, 22, 23, 24, 25]. 
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Figure 1: J£ T distribution in top and stop events, where we have considered stop decays to massless 
neutralinos. The rate is normalized to the number of events with two isolated leptons. 



difficult to separate from top pair production [39]. Furthermore, unlike compressed 
supersymmetry scenarios, the events do not become more distinctive when recoiling 
against an additional hard jet [40]. 

The stealth stop regime is the most challenging and can involve a large new physics cross 
section at the LHC. This regime is the focus of our current study. 

We illustrate the stealth regime in Fig. 1, which shows the missing transverse energy 
distribution for dileptonic events from top pairs and 200 GeV stop pairs (decaying as 
t — > tx°)- This is based on a simulation with cuts that we will describe in Sec. 4.3. The 
distributions for tops and stops are very similar, because in the rest frame of the stop, in 
the limit of small mass difference and massless x > the momentum of the decay products 
is ~ 5m = mi — mt- In the lab frame, the x° carries away invisible momentum of order 
7 5m, and for production of typical stop pairs the boost is not large. 

If a stop decays to a massless neutralino, the transition from the three-body regime 
to the stealth regime is not smooth. The three-body decay ends abruptly at m^ = mt, 
at which point two-body stealth decays dominate until the mass splitting becomes large 
enough that the decays are no longer stealthy. The case of a stop decay to a gravitino is 
slightly more subtle; the gravitino couples to SUSY breaking, leading to two extra powers 
of mi — mt phase-space suppression in the two-body decay rate. This allows the three- 
body regime to extend to somewhat higher masses, as illustrated in Fig. 2. (This plot 
and others throughout the paper rely on simulations performed with MadGraph 5 [41], 
as well as goldstino vertices we have implemented [42] using the UFO format [43]). The 
estimates in [8] show that current analyses have weakened sensitivity in the range mt < 
mj < 250 GeV, which we will take as our characterization of the stealth stop window. 
We review the current searches relevant for stops in Sec. 2, characterizing the extent to 
which they are simple top rate measurements in this window. Although more data will 
reduce the statistical errors on measurements of the top, both experimental systematics 



- 3 - 



Fraction of 3 -body Decays 




170 180 190 200 210 

M stop [GeV] 

Figure 2: Fraction of stop decays, t — > W + bx, which are three-body, as a function of the stop 
mass. More precisely, we are labeling a decay "three-body" when m(W + b) < m t — 3r t , and have 
taken the top quark mass to be 173 GeV. The neutral fermion \ is either the gravitino G or a 
massless bino B. In the gravitino case, three-body decays persist for larger stop masses, so the 
"maximally stealthy stop" is at masses nearer 200 GeV than 175 GeV. 
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Figure 3: NLO stop pair production cross section at the 8 TeV LHC, as reported by Prospino [45]. 
The vertical axis on the right shows the rate as a ratio to the tt rate. 

and theoretical uncertainties will remain. Measurements of the top are notoriously difficult 
(see, e.g., Ref. [44]), and so the more handles one has to constrain/discover stops, the 
better. 

Here we present a set of search strategies which can be used to constrain stops in this 
difficult region of parameter space. While we find no single smoking-gun signature for light 
stops, a few robust physical considerations can enhance the sensitivity of searches. It will 
be important to combine these considerations, because stops are rare. At the 7 TeV LHC, 
the top cross section is about 165 pb (increasing to about 230 pb with 8 TeV collisions [46]), 
while the stop cross section for = m t is only about one-sixth as large, and drops steeply 
at larger masses. (See Fig. 3 for NLO results; recent, more accurate, calculations of stop 
production may be found in [47].) Thus, finding stops by simply measuring the total rate 
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Stops near mt can increase the measured value of a t t by up to ~ 15%. However, (1) 
even at NNLL the uncertainties in a t i are ~ 10% and (2) theoretical ambiguities in 
the interpretation of the measured value of m t can further increase the uncertainty 
by a similar degree. 

While the LSP from stop decays contributes to the J5 T in an event, this effect is 
usually greatly diluted by the contributions from neutrinos (again, for m,- near 
mt). When this is not the case {e.g., with fully-hadronic top decays, or in the 
tails of m,T distributions) we expect the experimental systematics to be very chal- 
lenging. 

The shape of the Ay(t, t) distribution differs markedly between stop and top 
production. However, once one accounts for the small stop production rate this 
effect can be mimicked by QCD NLO uncertainties. 

Measurements of top spin correlation at the LHC are quite insensitive to higher 
order corrections, and show a clear difference between top and stop production. 
While we expect this measurement to be statistically limited, we conclude that 
this is one of the most robust channels to use in searching for light stops. 



Table 1: A summary of observables sensitive to the presence of stops above the SM di-top back- 
ground, and comments on the feasibility of employing these in searching for light stops. Note that 
when wc speak of tops in stop events we are referring to stops reconstructed as tops. 

of events passing top selection cuts is a challenge. 

The most important physical handle on stops in the stealthy regime is the character- 
istic that allows them to play their divergence-canceling role in SUSY: they are bosons, 
not fermions. In particular, because they are spin zero, they are produced without spin 
correlations and the decay of a stop and antistop are completely uncorrelated. Top pair 
production, on the other hand, involves spin correlation effects, linking the decay angles 
on the two sides of the event. Thus, a top pair sample enriched by stealthy stops should 
have correlations that are washed out by an amount related to the stop cross section. The 
effect is small, but it still can be used, and we elaborate on these ideas in Sec. 3. Even 
more important, it was shown in [48] that full matrix element spin-correlation measurement 
is stable with respect to NLO corrections, suffering from very mild systematic uncertain- 
ties. Stability with respect to NLO corrections distinguishes this measurement from other 
methods. 

Other effects that we use to discriminate between these two samples have to do with 
smallness of tt production compared to tt. Two effects conspire to suppress the production 
rate of stops compared to tops near the threshold. First, the process gg — > tt has t- and 
■u-channel singularities, which are regularized by mt- There is no analogous singularity 
in tt* production; therefore tt production is enhanced, and tt events tend to have larger 
rapidity gap between tops, relative to tt*. This is the most important stop rate suppression 
mechanism at the LHC. On the other hand, one can take advantage of the rapidity gap 
as a possible discriminator. We will elaborate on this potentially useful handle in sec- 



- 5 - 



tion 4, though we notice that this approach can suffer from unpleasantly large systematic 
uncertainties. Another mechanism which suppresses the stop production rate is a p-wave 
suppression in the s-channel stop production from a qq initial state, motivating a look at 
the higher regime, where the stop portion in the sample might be enhanced. This pro- 
cess is very important at the Tevatron, but less interesting at the LHC due to suppressed 
qq parton luminosities. We will further comment on this in Appendix B. Although we do 
not show a single channel allowing for stealthy stop discovery, we claim that a combination 
of the above mentioned tools can give us more than 2a sensitivity to a stealthy stop by the 
end of y/s = 8 TeV run. We summarize the expected sensitivity in Table 1. 

Our paper is organized as follows. In the next section we review the current searches 
for SUSY and further discuss the features of the stealth regime. In section 3 we discuss 
discrimination between tops and stops through spin correlation. In section 4 the rapidity 
gap and its practical use are discussed. (In Appendix A we briefly show how these tools 
can be combined to achieve better sensitivity.) Finally in Sec. 5 we conclude. 

2. Current LHC Searches 

Several existing LHC studies have some bearing on the question of light stops, and will 
become increasingly applicable with more data. In this section, we will survey them, argu- 
ing that currently the main discriminating variables being used are the top cross section, 
missing Er, and more sophisticated proxies for missing Et (like transverse mass). 

Because kinematics in the stealth regime resembles SM ti production, measurements 
of the top quark cross section potentially constrain stop production. One difficulty with 
attempting to find new physics simply by measuring the top cross section relative to its 
Standard Model value is that systematic uncertainties matter. As the top quark pole mass 
varies by about 5 GeV, the total cross section for ti production varies by about 15% [49]. In 
order to make a convincing measurement of new physics, then, one needs both an accurate 
measurement of the total cross section and an accurate kinematic measurement of the top 
pole mass. Furthermore, the theoretical prediction of cross section given mass must be 
accurate enough to resolve a discrepancy. Current theoretical calculations of cross sections 
have 10% errors from a combination of scale variation, a s , and PDF uncertainties [46, 49]. 
Measurements of the top mass from kinematics can be fraught with difficulties over the 
precise definition of "top mass," with a common claim being that experiments have tra- 
ditionally measured the ill-defined Pythia mass rather than the pole mass. Even the pole 
mass of the top is not well-defined, requiring a specification of the treatment of IR ambi- 
guities due to confinement in QCD [50] . Inferences of the top quark pole mass from total 
cross section (using NNLO theory calculations [51]) at D0 [52], CMS [53], and ATLAS [54] 
yield 167.5+4;g, 17Q.3±l% and 166.4±^;| GeV, while the latest Tevatron combination for the 
top mass inferred from kinematics is 173.2 ± 0.9 GeV. A CMS cross section combination 
measured 165.8 ± 2.2 (stat.) ± 10.6 (syst.) ± 7.8 (lumi.) pb [55], whereas an ATLAS cross 
section combination measured 177 ± 3 (stat.)ly (syst.) ± 7 (lumi.) pb [56]. Taking all of 
this into account, it is clear that any apparent 15% deviation in the top cross section would 
be much more plausibly attributed to uncertainties in a s , PDFs, nit, luminosity, or any 
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Figure 4: Estimated relative efficiency of various existing studies for tops and for stops decaying to 
(on- or off-shell) top and massless bino. Notice that efficiencies for stops do not become significantly 
larger than for tops until the stop mass is above about 210 GeV; the regime below this is the 
especially challenging case of stealthy stops. At higher masses, stops are easier to separate from 
tops, although rates become small. (At 210 GeV, the Prospino estimate shown in Figure 3 is 

Cstop ~ 0.06cr t op.) 



number of other systematic sources than to new physics. A claim of stop discovery must 
rely on kinematic and angular distributions, not simply rates. 

Stops could also be constrained by various new physics searches, although these cur- 
rently set weak limits, as discussed in [8, 9, 10]. We have computed the efficiency for stop 
pair production events to pass the cuts of some of the existing searches, and plotted its 
ratio to the efficiency for SM ti events in Figure 4 as a function of the stop mass. (The sim- 
ulation makes use of Pythia [57, 58] and Fast Jet [59, 60].) As representative examples, we 
have plotted the medium cuts of the CMS all-hadronic search [4], the 250 — 350 bin of 
the CMS lepton projection analysis [61], the high-MET selection of the CMS opposite-sign 
dileptons analysis [62] , and the ATLAS ti + MET analysis [63] . The estimate of the reach 
from these searches presented in [8] showed a possible limit near the top mass from the 
CMS £ + jets + MET search [61], and a hope for future exclusions above 250 GeV from the 
ATLAS it + MET search [63]. Moreover, the estimates of [9] showed a borderline exclusion 
of a single stop with a mass around 300 GeV by the ATLAS search [63] . The discrepancies 
in estimates between Refs. [8] and [9] are very minor and can possibly be explained by 
details of the simulation and finally resolved by an updated experimental search. 

There are two interesting features to note in Figure 4. The first is that, for stops 
heavier than about 240 GeV, it is straightforward to build a search that has much higher 
efficiency for identifying stop events than top events. In particular, the ATLAS ti + MET 
search does very well by focusing on the semileptonic channel and requiring a transverse 
mass cut mx(^, E T ) > 150 GeV. For semileptonic top events, one expects this transverse 
mass variable to be dominated by the neutrino from a W decay and typically bounded 
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Figure 5: Fraction of events with transverse mass m-r in a given range and passing jet and lepton 
selection criteria from the ATLAS tt + MET study [63] . Notice that stop events have a larger tail 
above My/ than top events do, even though the stop mass is barely above the top mass. This is 
due to three body decays t — > bW + Xi- In the large mj bins, the efficiency for stops is about twice 
the efficiency of tops. 



above by 80 GeV. A significant background from dileptonic top events where one lepton is 
lost remains, but this too can be rejected by a clever variation on Mt2 [30]. Similarly, one 
could search in the dileptonic channel using Mt2 constructed with two leptons and missing 
Et to reject ti backgrounds, again because it has an edge at the W mass [37, 8]. Thus, 
the lesson to take away from the dramatic rise in the blue dot-dashed curve in Figure 4 is 
that cutting on kinematic variables that are bounded above in the background can be very 
effective, and generalizations of these variables will be an interesting tool to apply in the 
future. 

The second feature of Figure 4 that is interesting is the stealth regime, roughly 
175 GeV < M s top < 220 GeV (the upper limit of this regime is somewhat subjective). 
One might have expected that in this regime, the efficiency for stops would be almost 
identical to that for tops. We see in the figure that this isn't true, even for M stop < 180 
GeV; the efficiency of the selection cuts for stops very near the top mass is about 30% 
to 60% higher than that for tops, depending on the precise stop mass and the study con- 
sidered. The reason is that, in the stealth regime, the bulk of the kinematic distributions 
for tops and stops are similar but the tails can still be significantly different. All of the 
new physics searches we have plotted involve large missing Et requirements, which are 
more easily satisfied in the stop events than top events. In particular, for stops barely 
above the top mass, we have checked that the enhancement of the stop efficiency with the 
ATLAS tt + MET selection cuts is explained by events in which at least one stop decay 
is 3-body, with m{bW) <C mt, allowing the bino to carry away significantly more missing 
momentum than stealth kinematics would predict. The resulting difference in transverse 
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mass distributions is shown in Figure 5. 2 At higher stop masses, 3-body decays become 
less important, but tails of kinematic distributions in which the bino momenta are as large 
as possible play a key role in the enhanced efficiency Similar results were presented for 
Tevatron searches in Fig. 9 of Ref. [14] , which focused on the gravitino case in which even 
more 3-body decays are present and the stop to top ratio can be even higher near the top 
mass. The lesson is that even in the stealth regime, missing Ej~ variables, especially clean 
ones like Mt or Mti that can be designed to have edges in the background, can still play 
a useful role in constraining stops. However, 30% to 60% gains in S/B when starting with 
a signal an order of magnitude below the background leave much to be desired. A further 
difficulty is that, at these low masses, stop events will contaminate control regions of many 
studies, making it more difficult to infer the presence of a signal. We expect that in this 
regime missing energy variables are best used as part of a larger toolkit that exploits other 
differences in top and stop kinematics. In the rest of the paper, we will aim to build that 
toolkit. 

Before continuing, let us make a simple estimate of the number of events that we 
can expect to be able to work with. With 20 fb -1 and an NLO tt rate of about 200 pb, 
and estimating a typical selection efficiency of about 20%, we expect to have (ignoring 
taus) about 40,000 dileptonic top events, 200,000 semileptonic top events, and 350,000 
all-hadronic events. In this paper we will concentrate on signals with leptons, to avoid 
the potentially large QCD backgrounds in the all-hadronic channel. Note that if we knew 
the dileptonic top rate perfectly, a sample of 4,000 stops on top of 40,000 tops would have 
S/y/B ~ 20. Of course, in reality systematic uncertainties overwhelm a simple cut-and- 
count method, but this number shows that our task is not hopeless, provided we can find 
shape variables with convincing differences. 

3. Spin Correlations 
3.1 Spin correlation 

In the SM tops, being fermions, are pair produced with correlated polarizations that man- 
ifest in certain measurable angular variables [64, 65, 66, 67, 68, 69, 70]. For instance, at 
the LHC, where top production is dominated by gg — > ti, when one top is left-handed 
the other tends to be as well, and similarly for the right-handed case (see Ref. [71] for an 
overview, and [72] for a study in the context of new physics). However, tops produced via 
stop decays have no such correlation because the scalar stops can not preserve any spin 
information. Therefore we can take advantage of techniques that were previously devel- 
oped to measure spin correlation in ti pairs. In our case the goal is much more challenging. 
We do not compare the hypothesis that the top and antitop have Standard Model spin 
correlation with the hypothesis that they are completely uncorrelated, but rather with the 

2 The spill-out of ti events beyond rnrii, J5 T ) = niw is caused by dileptonic events (with one lepton 
being lost) or by events with a r (either leptonic or hadronic). Another possible source of these events can 
possibly be detector smearing, which is not captured here, since we do not run detector simulation. We 
expect that the latter effect is minor. 
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Figure 6: Acf)(£ + ,£~) for ti production, tt* production, and ti production with spin correlation 
turned off (i.e., the differential rates for production and decay are factorized and we randomize the 
top helicities in between). Notice that, from the point of view of this variable, stops are essentially 
the same as spin-uncorrelated tops. Also, polarization effects are small, as left- and right-handed 
stops have the same distribution. 



hypothesis that a spin-correlated ti sample has O(10%) contamination from scalar events, 
which approximately look like spin-uncorrelated tops. 3 

When the LSPs are soft, stop events are similar to top pair events without correlation. 
This is illustrated in Figure 6, which shows one distribution, Ac/)(£ + , £~), which is sensitive 
to spin correlations, and for which stops look like tops with spin correlation turned off. We 
have calculated the observable for tops with MC@NLO [75, 76] at parton level, and checked 
that corrections from varying the top mass and the renormalization and factorization scales 
are small relative to the shift that would arise from adding a sample of stops to the tops. 
This observable has been studied by ATLAS to probe the existence of spin correlations in 
ti production [77] , with the most recent update achieving 5a significance for the existence 
of nonzero correlation effects [78]. 

In order to confirm the SM top pair spin correlation Ref. [48] proposed a method using 
full matrix elements with and without spin correlation. This method has been implemented 
experimentally in Tevatron searches [79, 80], which observed evidence for spin correlation 
in both the dileptonic and semileptonic channels. Since many more top events are produced 
at the LHC than at the Tevatron, we are expecting a more precise measurement at the 
LHC of the ti spin correlation. Any deviation from the SM prediction will be a sign of 
new physics. In the presence of light stops, we will observe a mixture of correlated and 
uncorrelated top pairs. In the following, we discuss the use of the matrix element method 
in stop searches. We concentrate on the dileptonic channel in the following discussion. 

3 One other effect that could play a role in angular distributions turns out to be unimportant for us: the 
stop can be mostly right-handed or mostly left-handed (as some theoretical models predict; see e.g. [24]), 
and so the tops coming from the stop decays can be polarized. While it can be an appreciable effect if the 
mass splitting between top and stop is large [73, 74], it is a small effect in the stealthy regime, as we have 
checked explicitly. Hence, we will not discuss it further. 
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We have also checked the semileptonic channel, which gives a less significant effect due to 
combinatorial uncertainties. 

For a dedicated stop search, it would be optimal to use directly the stop matrix element, 
which involves both the matrix element for the 2 — > 2 process and the one for subsequent 
particle decays. However, as we will discuss further in Section 4, the differential cross 
section for the 2 — > 2 process is subject to large theoretical uncertainties. On the other 
hand, it was observed in Ref. [48] that the spin correlation information for ti production 
is stable against NLO corrections at the LHC. Therefore, it is illuminating to examine the 
spin correlation alone, without invoking explicitly the stop pair matrix element. For this 
purpose, we use the matrix elements for top pairs with and without correlations, as they 
were given in [70] . (The spin uncorrelated matrix element was derived from the assumption 
that top decays are completely spherically symmetric in the rest frame of the top). This is 
also a generic approach that allows us to find signs of new physics in ti correlations without 
specifying the underlying theory. 

3.2 Implementation in stop searches 

In the dileptonic channel, we reconstruct the events and calculate the likelihood distri- 
butions as follows. Top and stop events are generated at parton level (including spin 
correlations) with MadGraph [41] and showered with Pythia [57, 58]. Jets are clustered 
with Fast Jet using the anti-fc^ algorithm with radius 0.4 [59, 60], and we apply simple 
isolation requirements to define leptons, which are identical to the isolation criteria of [81]. 

1. Events are required to pass the following kinematic cuts, which closely follow (but 
are not identical to) the event selection used by ATLAS in Ref. [81]. 

• At least two jets with pt > 25 GeV and \rj\ < 2.5, one or two of which must be 
b-tagged. 

• Two opposite-sign leptons with pj- > 20 GeV and \rj\ < 2.5. 

• In the e + e~ and fJ> + n~ channels, the invariant mass of the two leptons must 
satisfy m# > 15 GeV and \mg£ — Mz\ < 10 GeV. In addition we demand 
E T > 40 GeV. 

• In the e/i channel, no cut is applied on ma or E T . Instead, a cut on Ht 
(defined as the scalar sum of all selected charged leptons and jets) is applied: 
H T > 130 GeV. 

2. We apply a flat b-tagging efficiency of 65% (this is conservative relative to the 80% 
quoted in Ref. [81]). For events with only one b-jet, we consider all non-b jets with 
Pt > 25 GeV as a candidate for the other b-jet. Then for each combination of two 
b-jets and two leptons, we can use the W and top mass shell constraints to solve for 
the momenta of the two neutrinos from W decays, assuming they are the only missing 
particles in the event. There could be 0, 2 or 4 real solutions. We discard events 
without real solutions. The final efficiency is 16.8% for tops and 15.6% for stops. All 
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Figure 7: The likelihood variable TZ. Left: parton level; right: jet level. 



solutions from all combinations are taken into account in the following calculation. 
Following Ref. [48], we define the probability distribution for a given event as 



(3.1) 



where H = {corr,uncorr} denotes the hypothesis of correlated or uncorrelated tops, 
Mh is a normalization factor, J a is the Jacobian which appears when integrating over 
the neutrino momenta, fi and fj are the parton distribution functions for the two 
incoming partons, and M 1 ^ is the leading order matrix element. The sum over a 
is the sum of all combinations and solutions. The sum over i, j is over all possible 
initial partons, gg, uu and dd. For each event, we calculate both P C orr and P U ncorr 
and define the variable TZ as 



n 



Pc, 



(3.2) 



The variable TZ is the likelihood for an event to be a correlated top pair. The TZ 
distributions for top event and stop events are shown in Fig. 7. For comparison, we 
also show the TZ distribution at the parton level. 

3. Given the TZ distributions, we can also follow Ref. [48] to calculate the log likelihood 
ratio L to discriminate between two hypotheses with a set of N events. Here, L is 
defined as 

L = 21n(A/£ t ~), (3.3) 



where 



N 



(3.4) 



and pK^JZi) is the probability density read from Fig. 7. we choose the number of top 
pair events corresponding to 20 fb _1 at 8TeV and compare it with the same number 
of events, but with top and stop mixed in the ratio of 12:1, corresponding to the 
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Log Likelihood Ratio L, Parton Level 



Log Likelihood Ratio L, Jet Level 




Figure 8: The log likelihood ratio L. Each point on the curve corresponds to a pseudoexperiment 
with 32.8k events. Note in particular that each top plus stop pseudoexperiment is normalized to 
the same number of events as each top pseudoexperiment. Left: parton level; right: jet level. 

ratio in the cross sections. After kinematic cuts and reconstruction, we are left with 
32.8k top events for the pure samples, and 30.4k top events and 2.4k stop events for 
the mixed sample. We generate 10k pseudo-experiments corresponding to the two 
cases and the resulting L distributions are shown in Fig. 8. For comparison, we also 
generate 10k pseudo-experiments at the parton level using the same number of events 
as the jet level (after reconstruction). From the jet level result in Fig. 8, we estimate 
that on average a light stop of 200 GeV can be excluded at 95% confidence level. 

4. Rapidity Gaps 

4.1 Top and stop production amplitudes 



Initial state 


it 


tt* 


99 


68 pb 


11 pb 


qq 


23 pb 


1.6 pb 



Table 2: LO cross sections for top and stop production processes in MadGraph, for a 180 GeV 
stop. Notice, in particular, the tininess of stop production from qq due to the p-wave threshold, as 
well as the significant suppression of stop relative to top production even from gluons. 

As illustrated in Figure 3, the pair production rate of stops, even with a mass equal to 
the top mass, is well below the pair production rate of tops. Delving into the differences 
in stop and top production processes will shed some light on this, and also suggest a set 
of variables that can be useful in discriminating top from stop events. The rates for top 
and stop production at leading order, from either qq or gg initial states, are shown in 
Table 2. The most striking fact is the smallness of qq — > tt* , which is explained by p-wave 
suppression: the stops in the final state need to carry angular momentum. Since they have 
no spin, this implies that they are produced in a p-wave with a rate oc /3 3 near threshold. 
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However, no such simple argument explains the ratio between stop and top production 
initiated by gluons. Naively, we might expect that the top and stop are both color triplets, 
so that for energies far above threshold, their production rates will be identical up to factors 
counting degrees of freedom, so that the rate for stops is asymptotically half that of tops. 
Table 2 makes it clear that this misses some aspect of the physics. 

We can get some insight by considering the behavior of the differential cross section in 
the massless limit. For stops, there is a well-defined total rate [47]: 

a(gg -> M*) ^ s>>fn (4.1) 
On the other hand, for massless quarks, one has [82]: 

which doesn't have a well-defined integral over phase space due to the t- and u-channel 
poles. Because the top is massive, we expect that this integral will be regulated and that 
the result will be a log enhancement in the rate for forward top production in the 
parton center-of- momentum frame. This is a very real physical difference between the 
production of top quarks and scalar top quarks, which we should try to exploit. It is easy 
to see by considering the even simpler case of scalar QED versus QED, where the simplest 
MHV amplitudes with the right little group properties are easily written down and give 
the correct answers: 

A tree (l+,2-,3 ,4 ) = ie 2 ^^] = e 2 x phase (4.3) 

, treen+ _ _ ^_,„ 2 [14](2 3) _ _ 2 pa 



A tree (1 + 2 - 3 - )4 -p = ^ = ^ _ x ^ (44) 

The latter case corresponds to the familiar splitting function ameliorating the pole in a 
t-channel diagram to the square root of a pole in the amplitude. In the scalar case, this 
pole is completely absent. The two results are, in fact, related by a SUSY Ward identity. 

4.2 Parton-level distributions and systematics 

Our intuition from the limits of massless particles is useless if it does not carry over to 
a fact about physical, massive tops and stops, but it does. The distribution of rapidity 
gaps between produced tops and stops is shown in Figure 9. The distributions are clearly 
different. However, we should keep in mind the fundamental fact about stop rates: in 
a sample of candidate top events, we are usually looking for at most 10% of the sample 
composed of stops, so even a 50% difference in the shape of pure top and stop distributions 
will become a 5% effect in the combined sample. We must investigate the robustness of our 
Monte Carlo predictions of the shape of the top rapidity gap distribution, to understand 
whether such an effect can ever be measured. We will begin by assessing this at parton level, 
ignoring subtleties associate with jets and detectors. Making the case that the difference 
is observable at this level is a necessary but not sufficient precondition for claiming that it 
can be measured at the LHC. 
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Rapidity Gap 



F 0.15 



Stops (220 GeV) 
Stops (180 GeV) 
Tops (MC@NLO) 

Tops, H = 0.5 //Default 
Tops. H = 2.0 //Default 




Ay 



Figure 9: Ay distribution between top and antitop for ti production (orange) and for a 180 GeV 
scalar top (blue), both normalized to 1. Stops are more likely to be produced at small rapidity 
difference because the t- and w-channel poles in fermionic quark production from gg are absent in 
scalar quark production. Three curves are shown for tops, corresponding to different choices of 
renormalization and factorization scales in MC@NLO. 



We have generated large samples of ti events using MC@NLO [75, 76] to assess the 
systematic uncertainties. We have varied both the factorization and renormalization scales, 
and the top quark mass, to analyze the effects on the shape of the Ay distribution. The 
fractional change in shape, bin-by-bin, is plotted in Figure 10. Here we have computed the 
binned Ay shape for different scale choices, normalized the shapes to unit area, and given 
the size of the shift in each bin relative to MCONLO with default choices. We do not show 
variations of the top mass in the plot because we found the shape to be insensitive to it. We 
also show in Figure 10 the change in the default shape from tops when 12% of the sample is 
composed of 180 GeV stops instead of tops. Although the shape change can be significant, 
we see that much of the effect can be mimicked by increasing the renormalization scale. 
This compares only the shape, not the normalization. Doubling the renormalization scale 
also lowers the cross section by about 13%, whereas the presence of 180 GeV stops increases 
it by about 12%. However, the lowered cross section can be compensated to some extent if 
the top mass is lower, and the total top rate has at least a 10% theoretical uncertainty, as 
we reviewed in Section 2. The bottom line is that the rapidity distribution carries definite 
physical information, but to use it we must be cautious about systematic uncertainties in 
our understanding of top quark production. Continuing progress in understanding of tops 
at NNLO [83] could play a role in reducing these uncertainties. 

Parton distribution functions could also be a source of uncertainty. We have not 
systematically explored this, but as one illustrative example, switching MC@NLO's choice 
of PDF set from the default CTEQ 6M to an Alekhin NLO FFN set produced a curve 
similar to the \x = 0.5 //Default curve in Figure 10. Understanding PDF uncertainties would 
clearly be one component of getting Standard Model ti predictions under enough control 
to make statements about the presence of new physics with any confidence. 
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Figure 10: Relative fractional change in the fraction of events in each bin of Ay. The comparison 
is to MC@NLO run with default renormalization and factorization scales. The dotted dark orange 
curve is the result of halving these scales. The dashed light orange curve doubles the scales. The 
blue curve is the result of adding an admixture of 180 GeV stops (12% of the events, based on the 
relative NLO cross sections). The concern is that the effect of stops may be mimicked by a larger 
renormalization scale. 



So far we have been discussing only inclusive parton-level quantities. Cuts on kinematic 
variables can help to draw out the differences between stops and tops. In Figure 9, both 
stops and tops peak at Ay = 0. This is because both are dominantly produced near 
threshold, where one has Ay = by definition. In fact, one can easily see that as a 
function of the center-of-mass energy of the parton collision, there is a bound on Ay 
in tt events: 

Ay < log v * (4.5) 

s — \J s — 4m| 

The same consideration applies to the ti subsystem of a stop event. Figure 11 shows where 
events lie in the plane of (M t j, Ay); tops are seen to prefer larger Ay at larger M t p This is 
not surprising since the t-/u-channel singularity is regularized by m t , and as s grows, tops 
approaches a massless quark limit. Thus, a cut on s will allow us to isolate the regime where 
there can, in principle, be large differences between top and stop events. However, there is 
a caveat: a hard cut on s will make the difference in distributions clear, but runs the risk of 
diluting our samples enough that statistical uncertainties overwhelm the systematics and 
prevent us from drawing a clean conclusion. 

As the bottom right-hand plot of Figure 11 shows, the mass cut also does not eliminate 
the systematic uncertainty: varying the factorization and renormalization scales upward 
by a factor of 2 continues to mimic much of the effect of adding stops to the top sample. It 
may be that a variable can be constructed that will capture similar physics while reducing 
systematic uncertainties. We have studied the Collins-Soper angle, which corrects the angle 
of the top quark to the beam direction to minimize the effects of ISR [84] , as one possibility, 
but found that it was not an improvement. 
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Figure 11: Top: correlation between the tt invariant mass (which for tops characterizes the center- 
of-mass energy of the parton collision) and the value of Ay. Notice that at larger M t i , the tops 
are more strongly peaked at larger Ay than the stops. The dashed curves correspond to Eq. 4.5; 
some stop events spill over the line, because the decays are off-shell (in which case we have replaced 
Ptop with pb +Pw m the definitions of M t i and Ay). Bottom: the analogues of Figures 9 and 10 
after imposing a cut M t j > 500 GeV. The mass cut leads to a more dramatic shape difference, 
but the systematic effect of varying the renormalization and factorization scales can still mimic the 
signal. 

4.3 Jet-level distributions and practical use 

In spite of the systematic uncertainties discussed in the previous subsection, it can be 
useful to exploit the rapidity gap as one more variable that can potentially discriminate 
between a pure top sample and a stop-enriched one. The most straightforward way would 
be reconstructing the entire event and using the rapidity gap between the tops. Although 
possible, we find that this approach is not ideal. It is clear that some stop events, where 
the neutralinos contribute an appreciable portion of the JE T , will not pass reconstruction 
criteria. In this way, we lose the most pronounced non-top-like events. Nonetheless, ap- 
plying the same reconstruction as in Section 3, we find 2 a significance from Ay(t, t) alone. 
(The combination with spin correlations is discussed in Appendix A.) 

To avoid this undesired loss of valuable events, we find another variable which is closely 
tied to the rapidity gap between the tops. It turns out that the rapidity gap between the 
leptons in the dileptonic events closely follows the rapidity gap between the tops. The 
rapidity gap between the leptons also grows in a ti sample as m t i increases. 

We also do not use the top invariant mass in our study, because it would force us 
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Figure 12: Rapidity gap distribution for tops and stops, normalized to the rate of events passing 
the cuts. The distribution on the LH side includes all the events which pass simple ti acceptance 
cuts. The distribution on the right shows only those events which have a visible mass higher than 
375 GeV, as explained in the text. 



Lepton Rapidity Gap 




12 3 4 



Figure 13: Rapidity gap between the leptons after the cut on the "visible mass" as explained in 
the text. 



again to reconstruct the event. One might also consider reconstructing the invariant mass 
of all the objects in the event but this would introduce an undesired sensitivity to the 
ISR/FSR, deteriorating the correlation with m t i. Instead we choose a different quantity 
which closely mimics the behavior of the invariant mass and can be easily calculated without 
full event reconstruction. This quantity is an invariant mass of the leading visible objects 
in a dileptonic ti event. If both b-jets are tagged, we simply construct an invariant mass 
of two leptons and two b-jets. If only one b-jet is tagged, we construct an invariant mass 
of the leptons, b-jet and the non-tagged jet with the highest pt in the event. 

We illustrate our approach in Fig. 12. To build these plots we select the following 
events (these selection criteria are very similar to [85, 81]): 

• Events with precisely two isolated leptons. The pr of each lepton must exceed 20 
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GeV and \rji\ < 2.5. 4 

• Each event should have two or more jets, pr(j) > 25 GeV, \rj(j)\ < 3.0. 

• At least one jet should be b-tagged; we assume a flat tagging rate of 65%. 

• E T > 35 GeV 

• Discard the events with OSSF leptons and invariant mass 76 GeV < mu < 106 GeV 
(Z-window) 

• In the second sample we demand the invariant mass of the leading visible objects as 
explained before to be higher than 375 GeV. 

We also plot in Fig. 13 an event distribution in a mixed top-stop sample and compare 
it to pure top. Notice that our cuts slightly favor stops. The acceptance for stops is 
26.5%, while the acceptance for tops is 25.5%. This happens because the stop sample 
tends to have slightly higher E T than tops, even though this tendency is not sufficient to 
discriminate between them alone. This is the feature we illustrated early in the paper in 
Fig. 1. Note that the excess of events in in the bins with |Ar/| < 1 exceeds 10%, and maybe 
in combination with other distributions can be a good variable to discover or exclude the 
light stop. 

Finally we briefly discuss (lack of) contamination of these samples with other back- 
grounds. First, one might worry about [Z — > T£T£) + jj, but it was shown in [85] that this 
background becomes negligible when two or more jets are required. The second important 
background is dileptonic DY production with jets. In this background the missing Ej^ usu- 
ally comes from mismeasurement. It was also shown in [85] that a modest cut on E T > 35 
GeV and the requirement of two jets render this background subdominant. Although this 
search was performed for ^/s = 7 TeV, there is no reason to believe that the results of a 
^/s = 8 TeV search will be very different. Hence, have we neglected these backgrounds 
throughout the paper. 

5. Discussion 

Naturalness could still manifest itself through a variety of possible signatures. A heavy 
stop, for instance, could decay to a lighter stop to produce tiZ + E T [88]. The sbottom 
provides its own novel and possibly easier-to-find signatures [89, 90, 91, 92]. Higgsinos must 
be light for tree-level naturalness, and if they are the LSP or NLSP they can lead to more 
easily-observed stop decays [10, 93, 94]. Furthermore, deviations in Higgs boson production 
and decay rates from the Standard Model could also provide hints that naturalness is at 
work. Simultaneous pursuit of all of these signatures is necessary for a complete picture. 

Here we have focused on the "stealth stop" regime in which missing momentum is 
relatively small, and argued that rapidity gap and spin correlation observables can help to 

4 Even softer isolated muons were allowed in the search [85], if they were the softest lepton in the event, 
so in this sense our results can be further improved. 
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build the case for a new physics signal. For less stealthy stops, we expect variants on missing 
Et (especially transverse mass variables and their generalizations, which are bounded in 
tt background) to be relatively successful at digging stop signals out of top backgrounds. 
Even in that case, however, we would emphasize that the rapidity gap and spin correlation 
observables are diagnostic of the production of scalar particles. They may be less essential 
for discovering a signal, but could play a key role in establishing the nature of the signal. In 
particular, the distinct rapidity distributions of stops as opposed to fermionic top partners 
gives a new handle on spin determination. These measurements could be made with the 
next year's worth of data, and so we eagerly await the LHC's verdict on light stops. 
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A. Multivariate Analyses 

We have seen that there are various differences in distributions in top and stop events 
that can be understood on physical grounds, mostly related to the angular momentum 
properties of the final states. Although one could try to construct a stop search using any 
one of them, or using various cuts designed to enhance signal to background, we expect 
that the most flexible and powerful approach is a multivariate analysis that uses all of the 
information. First, we would like to check that the variables we have used so far are not 
highly correlated. 

One way to check for correlations is to bin one variable and plot the other; we take 
this approach in Figure 14, which shows that selecting particular ranges of A(p(£ + ,£~) 
makes relatively minor changes in the shape of the Ay(t, t) distribution. Another check 
is to compare the two-dimensional distribution of simulated points in the (Ay, A(j>) plane 
with the product of two one-dimensional distributions. Comparing binned samples of ti 
events (10 bins in A(f> by 12 bins in Ay), we find that in any given bin, the ratio of the two- 
dimensional probability to the product of the two one-dimensional probabilities is always 
in the range 0.92 to 1.08. Hence, we expect that the discussions in Sec. 3 and Sec. 4 are 
approximately orthogonal, and we can achieve better significance by combining the two 
approaches. (Of course, one would want to be sure systematic uncertainties on Ay are 
under control before taking this approach too seriously.) 
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Azimuthal Angle Dependence of Rapidity Gap 




Figure 14: Rapidity gap between top and antitop, as in Figure 9, with the samples binned based on 
A(f>(£ + ,£~). We see that the distributions are not strongly correlated, in the sense that restricting 
to a subset of A</> values makes little difference in the shape of the Ay(t, t) curve. 



Along these lines, we consider a simple multivariate method by taking the product of 
probabilities defined from one dimensional distributions [95, 96]. For a given variable x, we 
obtain the signal and background distributions in two histograms with the same binning 
and normalized to the same area. Then we define the probability of an event in bin i being 
a signal event as 

rv s + n b 

Correspondingly, the event has a probability (1 — ) being a background event. For 
multiple variables, the total probability is defined as 

T> = TlxPs /a 2 ^ 

11,/'.; • psy 1 ' ' 

Then we can treat V s as a single variable and repeat the likelihood test as in Sec. 3 for 
the two hypotheses and obtain the L distributions. In Fig. 15, we show the L distributions 
by combining the spin correlation variable 1Z and the reconstructed rapidity gap Ay(tt), 
for 20 fb _1 data. The reconstruction procedure is the same as in Sec. 3. Note that 
in this approach, the correlations among different variables are not taken into account. 
Nonetheless, as for the (Ay, A(f>) distributions, the correlations between the two variables 
are small. Therefore we obtain significantly better discriminating power than using each 
individual variable: the two distributions are separated by ~ 3a in Fig. 15, while the two 
variables alone each give us ~ 2a significance. 



B. Boosted Stops 

As discussed in Sec. 2, while measurements of the top production rate are in principle sen- 
sitive to stop production, difficulties arise because of challenging systematic uncertainties. 
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Two-Variable Log Likelihood Ratio L 




Figure 15: The log likelihood ratio L by combining two variables: R and Ay t j. Each point on the 
curve corresponds to a pseudoexperiment with 32.8k events. 

Indeed, even at NNLL theoretical uncertainties in the top cross section are comparable to 
the size of the contributions we expect from stops. 



pj of Tops and Stops p T of Visible Decay Products 
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Figure 16: Parton-level px distributions of tops and stops (left) and of the corresponding visible 
decay products (right). These distributions are normalized so that the top and stop share the same 
inclusive rate. Note that here we have taken mj = 200 GeV. 

However, the physics which suppresses the stop production rate relative to that of 
tops (i.e. phase space suppression from the larger stop mass, p-wave suppression, and the 
absence of t and u-channel poles - see Sec. 4) tends to have a diminished effect at higher 
Pt's. This can be seen by comparing normalized stop/top pt distributions, as shown on the 
left hand side of Fig. 16. One might therefore expect that through the use of top-tagging 
techniques (see Ref. [97] and references contained therein) one could observe a significant 
increase in top production at high px coming from the decay of boosted stops. 

Alas, this is not the case for the stealth region we have considered. The distribution 
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of top/stop pr's is steeply falling, and at high boosts the transverse momentum carried 
away by the LSP is enough to shift the distribution in px of the visible decay products 
downward enough to largely cancel any hoped-for benefits (the fractional change in px is 
roughly Am^ t /mt). See the right-hand side of Fig. 16. For this reason we do not include a 
more detailed study of boosted stops, although they can be useful outside of the stealthy 
regime. 
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